
SEQUENCE LISTING 



<110> ^flfcglison, JSjJ™ 111 

Yen, Juinn-Chern 

<120> NUCLEIC ACID MOLECULES ENCODING SERINE PROTEASE 16, THE 
ENCODED PROTEINS AND METHODS BASED THEREON 

<130> 24745-1625 

<140> 10/612,466 
<141> 2003-07-01 

<150> 60/394,347 
<151> 2002-07-02 

<160> 22 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 3147 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (23) ... (2589) 

<223> Nucleotide sequence encoding MTSPi 
<300> 

<308> GenBank #AR081724 
<309> 2000-08-31 

< 400> 1 . „„„ rrr«r* rnr Aflfl QQC QQa 52 



- K S? K Si SI 111 1% "1 IS SS 

1 5 " 

ggg ggc ccg aag gac ttc ggc gcg gga etc aag tac aac tec egg cac 
lly Gly Pro Lys Asp Phe Gly Ala Gly Leu Lys Tyr Asn Ser Arg Hxs 
15 20 

ss s: sa ss is a ss s: if? ss ss - s s: ?s s 

30 35 40 

aac ate aag aag gtg gaa aag cat ggc ccg ggg cgc tgg gtg gtg ctg 
Asn Val Lys Lys Val Glu Lys His Gly Pro Gly Arg Trp Val Val Leu 
45 50 55 

aca qcc qtg ctg ate ggc etc etc ttg gtc ttg ctg ggg ate ggc ttc 
Ala lit Va! Leu He Gly Leu Leu Leu Val Leu Leu Gly He Gly Phe 



60 



65 70 



ctg gtg tgg cat ttg cag tac egg gac gtg cgt gtc cag aag gtc ttc 
Leu va? Trp His Leu Gin Tyr Arg Asp Val Arg Val Gin Lys Val Phe 
75 80 85 



100 



148 



196 



244 



292 
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340 



ss is s a a s s ss a s s a s s si a 

95 100 

ss si si si s b s s = a s s s a si s 388 

110 115 



15? SI SI SI 22 SI IS S3 SI 52 2 i - - SI 
3 IS S K S3 SI IS SI SI SS IS S S3 SI IS SI 

»? s is is ss s ;s IS IS s ss S3 IS SS K is 

155 160 

S S3 S3 IS SS IS SI SS & SI SI SI SI SS IS 5 
S SI SI S SI fa! SS SI IS ?S S3 IS SI SI SI IS 

190 195 

S SI Sr ss ss s s? SS SI SI SI SI SI si ss SI 
SI IS 5 SS S! SS IS S? SI SI SI SI si is SI SI 

220 225 

g ,c , g c ccc tac ccc act cat ccc cgc go «. C. joe Jg « « 
Asp Ser Pro Tyr Pro Ala His Ala Arg cys wn f 2sq 
235 240 

IS IS SI IS SI SS SI SI SI SI 2S SI SI IS si is 

255 

S SI K SS SI SS SI IS IS S3 SI SI SI SI SI SS 
S SI SI SS SI IS SI SS S3 SS SS SI SS SI si si 

2 85 2 90 

III SI SI aS SS SI SI IS IS III SS SI S3 SS SI SI 

300 305 

£ 2 S SI SI s ss s ss IS SI SS SI SS IS si 



436 



484 



532 



580 



628 



676 



724 



772 



820 



868 



315 320 325 
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tte ttc eag ctg cct agg atg age age tgt gga ggc ego tta cgt aaa 1060 
Phe Phe Gin Leu Pro Arg Met Ser Ser Cys ^xy y s ^ 



335 340 



t-or cca acic cac tac cca ccc 
gee eag ggg aea tte aac age ccc tac tac cca gg ^ ^ prQ 

Ala Gin Gly Thr Phe Asn Ser Pro Tyr Tyr *<r y ^ 
350 355 

£ 3 || 3 S 3 K 3 53 £ S 3 H 3 3 
SI 3 3 3 3 3 3 3 2 3 S S 35 3 3 S 

380 385 

r, a r- t-»e ata qaq ate aat ggg gag aaa tac tgc 

ggc acc tgc ccc aag gac tac gtg gag ^ Lys ^ Cyg 

Gly Thr Cys Pro Lys Asp Tyr vai ^J-u xxc j 41Q 
395 400 

«-*-^ n+r* ni-r acc aqc aac age aac aag ate aca 
gga gag agg tee eag tte gtc gtc acc age ^ ^ Thr 

Gly Glu Arg Ser Gin Phe Val va± inr »^ 425 

e s 3 3 3 3 a s i s s s - Hi K 

is 3 i s - 3 s i K « ^ S Iff St? 3 s 

s s s I?? s 3 ;s s "? ss a s 3 s is s 

460 465 

e s 3 = s s = s « = 5 * s s K S 

s» 3 ss s 3 3 ss 3 3 SS 5S 3 3 3 s 

495 500 
gtc tgc gac agt gtg aac gac tgc gga gac aac age gac gag eag ggg 
Val Cys Asp Ser Val Asn Asp Cys Gly Asp Asn ser * p ^ 

510 51b 
t g c «t t g t c= g jc. «. acc tte J9| g tec .at i?? .39 tgc etc 
Cys Ser Cys Pro Ala Gin Thr Pne Arg ^y* ^ 5 
525 530 

S3 3 3 SS SS 3 S!? 3 3 3 3 1?? 3 SI? 3 
K 85 SS 3 3 SS 3 S3 =S 3 3 3 3 £ 3 SS 



1156 



1204 



1252 



1300 



1348 



1396 



1444 



1492 



1540 



1588 



1636 



1684 



1732 
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£ £ S £ £ £ I!? £ £ £ S 5? S - i 8S 

575 580 

£ 5 ?5 £ ^ £ 5 S g 1?? S S It? i? K - 

590 595 

K Si ??? a S S 52 1 S S2 If. 25 £ £ 15 15 
ffi £ S Sp it? 85 it? S - SI It? £ £ £ £ £ 
a 15 S ?5 £ £ £ 15 £ £ £ £ £ I" £ I 

635 640 

£ £ £ K in K £ £ £ £ 5 £ i5 £ I S 

655 660 

£ £ £ S it? £ S £ £ £ 15 £ £ £ it? £ 
£ £ £ K £ 15 £ it? It? £ £ £ £ £ £ "? 

685 690 

-*-^ ft-r aa t aac ttc acc ttc gac tat gac ate gcg ctg 
tec cac ccc ttc ttc aat gac «c ^ » Leu 

Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr *sp 

700 705 
ctg gag ctg gag aaa ccg gca gag tac age tec atg gtg egg ccc ate 
Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Mec vax g ^ 

720 

£ £ £ £ It? £ £ £ £ £ £ 15 £ ?t? £ £ 

735 740 

£ S if? £ it? ??? it? i - £ ??? it? £ "? 
£ Si? £ 15 it? £ £ £ £ £ it? S K £ ?t? £ 

765 770 

£ £ s a £ £ s ??? £ £ £ £ ?2 ?5 £ £ 

780 785 

K 15 15 £ £ I?? £ it? £ £ £ 15 it? ??? £ i 



2164 



2212 



2260 



2308 



2356 



2404 



2452 



795 800 
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age gtg gag 
Ser Val Glu 



gga gac ggc 
Gly Asp Gly 



cct ctg ttt 
Pro Leu Phe 
845 

ccacccaaat 
ggctggagac 
ctccagggct 
gggaggtaga 
agacacagcc 
cccctgtctg 
ggctgccgga 
aggaccctgg 
cttcagtgtg 
aaaaaaaa 



gcg gat ggg egg 
Ala Asp Gly Arg 
815 

tgc get cag agg 
Cys Ala Gin Arg 
830 

egg gac tgg ate 
Arg Asp Trp He 



2548 



ate ttc cag gee ggt gtg gtg age tgg 2500 
He Phe Gin Ala Gly Val Val Ser Trp 
820 825 

aac aag cca ggc gtg tac aca agg etc 
Asn Lys Pro Gly Val Tyr Thr Arg Leu 
835 840 

aaa gag aac act ggg gta ta ggggccgggg 2599 
Lys Glu Asn Thr Gly Val 



850 



gtgtacacct 
tggaccgctg 
ccaaatctgc 
aggggaggac 
tcccccgcca 
taaggagcag 
tctgggctgt 
aaaacagacg 
tgtatttgtg 



gcggggccac 
actgcaccag 
ctagaaaacc 
actggtggtt 
gccccaagct 
egggaaegga 
ggggcccttg 
ggtctgagac 
taaatgggta 



ccatcgtcca 
cgcccccaga 
tctcgcttcc 
ctactgaccc 
gggecgagge 
getteggage 
ggccacgctc 
tgaaattgtt 
aaacaattta 



855 

ccccagtgtg 
acatacactg 
tcagcctcca 
aactgggggc 
gcgtttgtgt 
ctcctcagtg 
ttgaggaagc 
ttaccagctc 
tttcttttta 



cacgcctgca 
tgaactcaat 
aagtggagct 
aaaggtttga 
atatctgect 
aaggtggtgg 
ccaggctcgg 
ccagggtgga 
aaaaaaaaaa 



2659 
2719 
2779 
2839 
2899 
2959 
3019 
3079 
3139 
3147 



<210> 2 

<211> 855 

<212> PRT 

<213> Homo Sapien 

<400> 2 

Met Gly Ser Asp Arg Ala 

1 ' 5 
Gly Ala Gly Leu Lys Tyr 
20 

Glu Glu Gly Val Glu Phe 
35 

Lys His Gly Pro Gly Arg 
50 

Leu Leu Leu Val Leu Leu 
65 70 
Tyr Arg Asp Val Arg Val 
85 

Thr Asn Glu Asn Phe Val 
100 

Phe Val Ser Leu Ala Ser 
115 

Ser Gly Val Pro Phe Leu 
130 

Ala Phe Ser Glu Gly Ser 
145 I 50 
He Pro Gin His Leu Val 
165 

Arg Val Val Met Leu Pro 
180 

Val Thr Ser Val Val Ala 
195 

Thr Gin Asp Asn Ser Cys 
210 

Leu Met Arg Phe Thr Thr 
225 230 
His Ala Arg Cys Gin Trp 



Arg Lys Gly Gly Gly Gly 
10 

Asn Ser Arg His Glu Lys 
25 

Leu Pro Val Asn Asn Val 
40 

Trp Val Val Leu Ala Ala 
55 60 
Gly He Gly Phe Leu Val 
75 

Gin Lys Val Phe Asn Gly 
90 

Asp Ala Tyr Glu Asn Ser 
105 

Lys Val Lys Asp Ala Leu 
120 

Gly Pro Tyr His Lys Glu 
135 140 
Val He Ala Tyr Tyr Trp 
155 

Glu Glu Ala Glu Arg Val 
170 

Pro Arg Ala Arg Ser Leu 
185 

Phe Pro Thr Asp Ser Lys 
200 

Ser Phe Gly Leu His Ala 
215 220 
Pro Gly Phe Pro Asp Ser 
235 

Ala Leu Arg Gly Asp Ala 
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Pro Lys Asp Phe 
15 

Val Asn Gly Leu 
30 

Lys Lys Val Glu 
45 

Val Leu He Gly 



Trp His Leu Gin 
80 

Tyr Met Arg He 
95 

Asn Ser Thr Glu 
110 

Lys Leu Leu Tyr 
125 

Ser Ala Val Thr 

Ser Glu Phe Ser 
160 

Met Ala Glu Glu 
175 

Lys Ser Phe Val 
190 

Thr Val Gin Arg 
205 

Arg Gly Val Glu 

Pro Tyr Pro Ala 
240 

Asp Ser Val Leu 



250 



255 



ser Leu Thr Phe Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly 
Ser Asp Leu III Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His 
Ala Leu Val Gin Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr 
Phe His Ser Ser Gin Asn Val Leu Leu lie Thr Leu He Thr Asn Thr 
III Arg Arg His Pro lly Phe Glu Ala Thr Phe Phe Gin Leu Pro Arg 
„et Ser Ser Cys lly Gly Arg Leu Arg Lys Ala Gin Gly Thr Phe Asn 
Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro Asn He Ajp Cys Thr Trp 
Asn He Glu Val Pro Asn Asn Gin His Val Lys Val Ser Phe Lys Phe 
Phe Tyr Leu Leu Glu Pro G ? ly val Pro Ala Gly Thr Cys Pro Lys Asp 
%l val Glu lie Asn lly Glu Lys Tyr Cys Gly Glu Arg Ser Gin Phe 
Val Val Thr Ser Asn Ser Asn Lys lie Thr Val Arg Phe His Ser Asp 
Oln Ser Tyr ill Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp 
Ser Ser asp Pro Cys Pro Gly Sin Phe Thr Cys Arg Thr Gly Arg Cys 
He Arg Lys Glu Leu Arg Cys Asp Gly Trp Ala Asp Cys Thr Asp His 
HI Asp Glu Leu Asn Cys Ser Cys Asp Ala Gly His Gin Phe Thr Cys 
Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp Val Cys Asp Ser Val Asn 
ASP cys Gly Asp Asn Ser Asp Glu Gin Gly Cys Ser Cys Pro Ala Gin 



Thr Phe III Cys Ser Asn Gly Lys Cys Leu Ser Lys Ser Gin Gin Cys 



A sn Gly Lys Asp Asp Cys lly Asp Gly Ser Asp Glu Ala Ser Cys Pro 
lys val Asn Val Val Thr Cys Thr Lys His Thr Tyr Arg Cys Leu Asn 
Gly Leu Cys Leu III Lys Gly Asn Pro Glu Cys Asp Gly Lys Glu Asp 
Cys Ser Asp lly Ser Asp Glu Lys Asp Cys Asp Cys Gly Leu Arg Ser 
Phe Thr III Gin Ala Arg Val Val Gly Gly Thr Asp Ala Asp Glu Gly 
Glu Trp Pro Trp Gin Val til Leu His Ala Leu Gly Gin Gly His lie 
%l Gly Ala ser Leu He Ser Pro Asn Trp Leu Val Ser Ala Ala His 
Cys Tyr He Asp t£ Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp 



Thr Ala Phe Leu Gly Leu His Asp G°ln Ser Gin Arg Ser Ala Pro Gly 



665 



680 



685 



Val Gin III Arg Arg Leu Lys Arg He He Ser His Pro Phe Phe Asn 

ASP Pne Thr Phe Asp Tyr HI He Ala Leu Leu Glu Leu Glu Lys Pro 

III Glu Tyr Ser Ser Met Val Arg Pro He Cys Leu Pro Asp Ala Ser 
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His Val Phe Pro SI Oly Lys Ala lie Trp Val Thr Oly Trp Gly His 
Thr Gin Tyr lly Gly Thr Gly Ala Leu He Leu Gin Lys Gly Glu He 
Arg val lie Asn Gin Thr Thr Cys Glu Asn Leu Leu Pro Gin Gin He 



730 735 



775 



Thr III Arg Met Met Cys Val Gly Phe Leu ser Gly Gly Val Asp Ser 



Cys Gin Gly Asp ser lly Gly Pro Leu Ser Ser Val Glu Ala Ajp Gly 
Arg lie Phe Gin til Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gin 
Arg Asn Lys Pro Gly Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp 

835 840 
He Lys Glu Asn Thr Gly Val 
850 855 



<210> 3 

<211> 3147 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

'AIT, ^c"ic-,c 2 ia S0 S U.„ce of protease do„aio o£ MTSP1 

s« «*=»»*■ cc,t gggg . g o^ogoec • SS > S > sssss: ••, 

gaaggacttc ggcgcgggac tcaagtacaa ct °«ggcac g a a a at gg CC cggg 180 

sis sis Iri fill isi ssi s 

catgaggatc acaaatgaga attttgtgga tgcccacgag _ tatacaqcq gagtcccatt 420 
tgtaagcctg gccagcaagg tgaaggacgc 9^gaagctg Jtgtacagcg 9J9 ^ 

= "iill 1 SS 3SSS S! 

iliiillilliil 

cgacctggtg acggtgtaca acaccctgag ^ccatggag » acgtcctgct 960 

sssss sees ™ : sbss »» 

E523 SSSS S3SS2 « | I I I tgjgtj- x». 

caacaaccag catgtgaagg tgagcttcaa attcttctac ^gctggagc ccggcg g 
tgcgggcacc tgccccaagg actacgtgga gatcaatggg 9agaaatact |cggagag g 

lH is m li if i Hi i 

2KSS SS5S SS5SSS SEES ^ 
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teat Qtt att ggg ggc acg gat gcg gat gag ggc gag tgg ccc tgg cag 
9 Vai Vai Gly G?y Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin 
X 5 10 

cat act eta age cag ggc cac ate tgc ggt get tec etc 
SX sir Leu His lit Leu Sy Gin Sy His He Cys Gly Ala Ser Leu 
20 25 

atc tct ccc aac tgg ctg gtc tct gee gca cac tgc tac ate gat gac 
111 Ser Pro Asn Tr? Leu Vai Ser Ala Ala His Cys Tyr lie Asp Asp 
35 40 

s si; s: s s ss k - £ ss s s s s ss 

50 55 60 

tta cac aac cag age cag cgc age gee cct ggg gtg cag gag cgc agg 
Leu His Asp III sir Gin Arg Ser Ala Pro Gly Vai Gin Glu Arg Arg 



etc aaa cgc ate ate tec cac ccc ttc ttc aat gac ttc acc ttc gac 
HI Lys Arg He He Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp 



80 



85 



hat aac ate aca ctg ctg gag ctg gag aaa ccg gca gag tac age tec 
Tyr Asp Til HI HI 111 Leu Glu Lys Pro Ala Glu Tyr Ser Ser 

100 105 



ata ata caa ccc ate tgc ctg ccg gac gec tec cat gtc ttc cct gee 
Me? vai Arg Pro He Cys Leu Pro Asp Ala Ser His Vai Phe Pro Ala 
115 120 

a „ ___ 4- aa a tc acq qqc tgg gga cac acc cag tat gga ggc 

ITy lyl He Tr? Va? Thr Sy Tr P c!y His Thr Gin Tyr Gly Gly 

130 135 

act ggc gcg ctg atc ctg caa aag ggt gag atc cgc gtc ate aac cag 
Thr Gly Ala Leu He Leu Gin Lys Gly Glu He Arg Vai He Asn Gin 
145 150 155 

acc acc tgc gag aac etc ctg ccg cag cag atc acg ccg cgc atg atg 
Thr Thr Cys 111 Asn Leu Leu Pro Gin Gin He Thr Pro Arg Met Met 
160 165 170 

tgc gtg ggc ttc etc age ggc ggc gtg gac tec tgc cag ggt gat tec 
Cys va! Gly Phe Leu Ser Gly Gly Vai Asp Ser Cys Gin Gly Asp Ser 
y 180 IBS 190 

ggg gga ccc ctg tee age gtg gag gcg gat ggg egg ate ttc cag gee 
Gly Gly Pro Leu Ser Ser Vai Glu Ala Asp Gly Arg He Phe Gin Ala 
1 1 200 205 



1909 



1957 



2005 



2053 



2101 



2149 



2197 



2245 



2293 



2341 



195 



orrri- ata aac tQQ aaa gac ggc tgc get cag agg aac aag cca ggc 

IS S Vai Ser Tr? G?y Lp Sy Cys Ala Gin Arg Asn Lys Pro Gly 
210 215 220 

ata tac aca agg etc cct ctg ttt egg gac tgg atc aaa gag aac act 
Vai ^r Thr Arg Leu Pro Leu Phe Arg Asp Trp He Lys Glu Asn Thr 
225 " 230 235 

Page 8 



2389 



2437 



2485 



2533 



2581 



2630 



ggg gta tag gggccggggc cacccaaatg tgtacacctg cggggccacc 

Gly Val * 
240 

catcgtccac cccgtgtgc acgcctge.g |ctggag«ct |9»«9«g» "gc.cc.gc 26*0 

SSS3S ssess ii? b b ESSE 2 ,"°o 

tactgaccca actgggggca aaggtttgaa gacacagccc cccc g a g 293 o 

SEES MS SS3SS i | Hi 

SSSSS 23SSK SSSSSE KSSS SSS5S a ~«S£- »» 

aacaatttat ttctttttaa aaaaaaaaaa aaaaaaa 



<210> 4 

<211> 241 

<212> PRT 

<213> Homo Sapien 



vl?°v«l Gly Gly Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin val 
Ser Leu His Ala Leu Gly Gin Gly His He Cys Gly Ala Ser Leu He 
Ser Pro Asn Trp Leu Val Ser Ala Ala His Cys Tyr lie Asp Asp Arg 
Gly Phe irg Tyr Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly Leu 
His Asp Gin Ser Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu 
Us Arg He He Ser lis Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr 
Asp He Ala Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met 



val Arg Pro He Cys Leu Pro Asp HI Ser His Val Phe Pro Ala Gly 



120 



Lys Ala He Trp Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr 



135 



Gly Ala Leu He Leu Gin Lys Gly Glu He Arg Val He Asn Gin Thr 



Th r Cys Glu Asn Leu Leu Pro Gin Gin lie Thr Pro Arg Met Met Cys 
Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly 



Gly Pro Leu III Ser Val Glu Ala As P Gly Arg He Phe Gin Ala Gly 



185 



Val Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val 



200 



Tyr Thr Arg Leu Pro Leu HI Arg Asp Trp lie Lys Glu Asn Thr Gly 



220 



225 230 
Val 



235 



<210> 5 
<211> 2293 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (2259) 
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<223> CVSP16 Full Length cDNA 

<400> 5 ^ tg rtt ate ate agt ccc 48 

Tal 
10 



288 



fir, 21 a a a a a a a a a a s s a 
s a ss ss a a ss a ss a if, a a a ss ss 
JS I" SS 2 K a ss s a ss a a ss s a ?s 

35 40 

1!? IS S =S S S a 1!? S S S S SS ?3 S 2 
SE IS SS II? K S a if? IS i 2 ?S Si £ 2 

S ?S £ ser SS E K S - Met ffi SS SS S 2 SS 

85 90 

«- « i-^rr ni- a eta eta qqc gtg cac tec cag gac ggg 

S S SS 85 SS 2 SS 2 a S!y «? « iS S„ al. »»P Gly 
100 105 

III a 2 SI? IS £ £ 5 SS ?2 SS SS 2 SS S3 2 

115 120 

is ss a a ffi a ss a ss ss ss a s a a a 
ss ss a a ss a a ss a ss a a a a a a 

145 150 155 

is s ss a a a a a a ss a ss a a ss a 

165 17U 

is a ss a a a a ss ss a a a a a a ?s 

180 185 

a a ss a a a a a a ss ss ss a a a a 

195 200 

ss a a a a ss a a a a a a a a a a 



96 



144 



192 



336 



384 



432 



480 



528 



576 



624 



210 215 



ggg atg ctg tgt get gge tae eea ggg ggc ege agg gae aee tge eag 
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672 



720 



Oly Mt K. Cys Al, Gly Tyr Pro Gly Gly Arg Arg Asp Thr cys Gin 

«S 15 S SS M? 25 S 2! 55 SS SS SS SS 5 1 S2 

245 250 

SS S SS J2 S S 25 85 52 SS §5 K S S - 5 

260 265 

~<-<r act act ata act acc tat gag gca tgg ata egg gag 

p C ro lly ?S P^e Thr fla fa! L. Thr Tyr Glu Ala Trp lie Arg Glu 

275 280 

SS S3 52 SS 55? S £ 35 - E 52 52 25 SS 52 S2 

290 295 

S S IS SS 12 25 52 SS 5 SS SS 2? SS S |2 

305 3 10 315 

K S3 52 IS 55 85 21 K S S - 15 25 S f» 55 

325 330 

SS K SS 52 2! S3 52 15 SS S 52 55 SS 15 K 21 
5! 555 IS S SI 25 21 K 52 E 25 55 52 52 12 551 
LS S 555 15 52 lit 555 15 12 25 12 25 55 2 9 2! 551 

37 5 o o u 

251 555 55? «?5 551 15 21 SS 15 21 H 15 52 2? SS 25 

385 3 90 395 

SS 155 12 55? Sf 12 21 155 15? 15 21 lit 21 52 SS 52 
a5 S 555 2? S5 55? 515 S 155 55? 15 25 2? 55 25 555 

42 0 42 5 

S5 55? SS 5*5 55 52 2? 555 SS 515 15 55 S 52 155 15 

435 440 445 

too acc cqc ggg gaa ccc gcg ctt ggc cca ggc gcg ctg ctg gag gcg 
Trp Gly Arg Gly Glu Pro 111 Leu Gly Pro Gly Ala Leu Leu Glu Ala 

r- 455 460 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



450 



gag ctg tta ggc ggc tgg tgg tgc cac tgc ctg tac ggc cgc cag ggg 
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1392 



1440 



Olu Leu Leu Gly Gly Trp Trp Cys His Cys Leu Tyr Gly Arg Gin Gly 
465 470 475 

s in s 2 2 s if; k ss 2 s s 2 2 2 s 

485 490 



s a is 2 is as sa sg 2 2 2 2 2 ss 2 2 



500 505 



r,*f + * a tac caa qag gag ggg acc tgg ttt ctg get gga ate aga 
sir Leu S Ss Oil Glu Glu S? Thr Trp Phe Leu Ala Gly He Arg 
515 520 



k s s ffi ss 3s 2 s 2 s s Hi ~ -° 25 s 

S 2 2 S 2 2 2 2 2 2 5 2 K SS 2 S! 
K S 2 E 2 K 2 2 S 2 SS SS SS 2 IS S 

ar-t tat ccc cca cac aca gag cat ggt gec tgt ggc ctg egg ctg 
Gin Thr Cys Pro Pro His Thr Glu His Gly Ala Cys Gly Leu Arg Leu 
580 585 

gag get get cca gtg ggg gtc ctg tgg ccc tgg ctg gca gag gtg cat 
l!2 Ala Ala Pro Val Gly Val Leu Trp Pro Trp Leu Ala Glu Val His 



595 



gtg get ggt gat ega gtc tgc act ggg ate etc ctg gee cca ggc tgg 
Va! La lly Asp Arg Val Cys Thr Gly He Leu Leu Ala Pro Gly Trp 



610 615 «0 



SS 2 2 lit 2 E SS 2 2 5 S 2 2 S 2 2 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



625 



630 " 635 640 

III £ 2 SS 2 2 2 Sg 2 S Ig K 2 SS 2 S 

645 650 



lg K S 2 S 2 1?; S S 2 2 2 S§ 2 S 

660 665 

IS ig 2 2 2 2 2 lg 2 2 2 SS lg 11? 2 IS 

675 680 

ctg ttt get gec att ggt cct gaa gag gec tgg ate tec cag aca gtg 
Leu Pne Ala Ala He lly Pro Glu Glu Ala Trp lie Ser Gin Thr Val 



695 700 



690 

gga gag gee aac ttc ctg ccc ccc agt ggc tec cca cac tgg ccc act 
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1968 



2016 



2064 



2112 



2160 



Gly Glu Ala Asn Phe Leu Pro Pro Ser Gly Ser Pro His Trp Pro Thr 
705 710 715 

gga ggc age aat etc tgc cec cea gaa etg gee aag gee teg gga tec 
Gly Gly Ser Asn Leu Cys Pro Pro Glu Leu Ala Lys Ala Ser Gly Ser 



725 



ccq cat gca gtc tac ttc etg etc ctg etg act etc ctg ate cag age 
III His SI Val Tyr Phe Leu Leu Leu Leu Thr Leu Leu lie Gin Ser 
740 745 750 

tga ggggctaggg tcccagcacc acttccccct tctc 



<210> 6 
<211> 752 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> CVSP16 Full Length Protein 



Met°Ala Arg Gin Leu Leu Leu Pro Leu Val Val Leu Val He Ser Pro 

iL Pro Gly Ala Phe Gin Asp Ser Ala Leu Ser Pro Thr Gin Glu Glu 

Pro Glu Asp Leu Asp Cys Gly Arg Pro Glu Pro Ser Ala Arg He Val 

35 40 45 

Gly Gly Ser Asn Ala Gin Pro Gly Thr Trp Pro Trp Gin Val Ser Leu 

55 60 



2208 



2256 



2293 



His HiB Gly Gly Gly His lie Cys Gly Gly Ser Leu lie Ala Pro Ser 

Kp Val Leu Ser Ala Ila His Cys Phe Met Thr Asn Gly Thr Leu Glu 

85 ^0 y 

Pro Ala Ala Glu Trp Ser Val Leu Leu Gly Val His Ser Gin Asp Gly 

100 105 
Pro Leu Asp Gly Ala His Thr Arg Ala Val Ala Ala lie Val Val Pro 

115 120 125 

Ala Asn Tyr Ser Gin Val Glu Leu Gly Ala Asp Leu Ala Leu Leu Arg 

L eu Ala Ser Pro Ala Ser Leu Gly Pro Ala Val Trp Pro Val Cys Leu 

150 155 j- o w 

Pro Arg Ala Ser His Arg Phe Val His Gly Thr Ala Cys Trp Ala Thr 

165 170 i/D 

Gly Trp Gly Asp Val Gin Glu Ala Asp Pro Leu Pro Leu Pro Trp Val 

Leu Gin Glu HI Glu Leu Arg Leu Leu Gly Glu Ala Thr Cys Gin Cys 

195 200 205 

Leu Tyr Ser Gin Pro Gly Pro Phe Asn Leu Thr Leu Gin lie Leu Pro 

Gly Met Leu Cys Ala Gly Tyr Pro Gly Gly Arg Arg Asp Thr Cys Gin 

Gly Asp Ser Gly Gly Pro Leu val Cys Glu Glu Gly Gly Arg Trp Phe 

245 250 
Gin Ala Gly He Thr Ser Phe Gly Phe Gly Cys Gly Arg Arg Asn Arg 

260 2 * 5 270 

Pro Gly Val Phe Thr Ala Val Ala Thr Tyr Glu Ala Trp He Arg Glu 
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275 280 285 

Gin Val Met Gly Ser Glu Pro Gly Pro Ala Phe Pro Thr Gin Pro Gin 

290 295 300 

Lvs Thr Gin Ser Asp Pro Gin Glu Pro Arg Glu Glu Asn Cys Thr lie 
305 310 315 _ 320 

Ala Leu Pro Glu Cys Gly Lys Ala Pro Arg Pro Gly Ala Trp Pro Trp 

325 330 335 

Glu Ala Gin Val Met Val Pro Gly Ser Arg Pro Cys His Gly Ala Leu 

340 345 350 

val Ser Glu Ser Trp Val Leu Ala Pro Ala Ser Cys Phe Leu Asp Pro 

355 360 365 

Asn Ser Ser Asp Ser Pro Pro Arg Asp Leu Asp Ala Trp Arg Val Leu 

370 375 380 

Leu Pro Ser His Pro Arg Ala Glu Arg Val Ala Arg Leu Val Gin His 
385 390 395 400 

Glu Asn Ala Ser Trp Asp Asn Ala Pro Asp Leu Ala Leu Leu Gin Leu 

405 ~ 410 415 

Arq Thr Pro Val Asn Leu Ser Ala Ala Ser Arg Pro Val Cys Leu Pro 

420 425 430 

His Pro Glu His Tyr Phe Leu Pro Gly Ser Arg Cys Arg Leu Ala Arg 

435 440 445 

Trp Gly Arg Gly Glu Pro Ala Leu Gly Pro Gly Ala Leu Leu Glu Ala 

450 ~ 455 460 

Glu Leu Leu Gly Gly Trp Trp Cys His Cys Leu Tyr Gly Arg Gin Gly 
465 470 475 480 

Ala Ala Val Pro Leu Pro Gly Asp Pro Pro His Ala Leu Cys Pro Ala 

485 490 495 

Tyr Gin Glu Lys Glu Glu Val Gly Ser Cys Trp Asn Asp Ser Arg Trp 

500 505 510 

Ser Leu Leu Cys Gin Glu Glu Gly Thr Trp Phe Leu Ala Gly He Arg 

515 520 525 

Asp Phe Pro Ser Gly Cys Leu Arg Pro Arg Ala Phe Phe Pro Leu Gin 

530 535 540 

Thr His Gly Pro Trp He Ser His Val Thr Arg Gly Ala Tyr Leu Glu 
545 550 555 560 

Asp Gin Leu Ala Trp Asp Trp Gly Pro Asp Gly Glu Glu Thr Glu Thr 

565 570 575 

Gin Thr Cys Pro Pro His Thr Glu His Gly Ala Cys Gly Leu Arg Leu 

580 585 590 

Glu Ala Ala Pro Val Gly Val Leu Trp Pro Trp Leu Ala Glu Val His 

595 600 605 

Val Ala Gly Asp Arg Val Cys Thr Gly He Leu Leu Ala Pro Gly Trp 

610 " 615 620 

Val Leu Ala Ala Thr His Cys Val Leu Arg Pro Gly Ser Thr Thr Val 
625 630 635 640 

Pro Tyr He Glu Val Tyr Leu Gly Arg Ala Gly Ala Ser Ser Leu Pro 

645 650 655 

Gin Gly His Gin Met Thr Ser Ala Pro Pro Leu Leu Cys Gin Met Thr 

660 665 670 

Glu Gly Ser Trp He Leu Val Gly Met Ala Val Gin Gly Ser Arg Glu 

675 680 685 

Leu Phe Ala Ala He Gly Pro Glu Glu Ala Trp He Ser Gin Thr Val 

690 695 700 

Gly Glu Ala Asn Phe Leu Pro Pro Ser Gly Ser Pro His Trp Pro Thr 
705 710 715 720 

Glv Gly Ser Asn Leu Cys Pro Pro Glu Leu Ala Lys Ala Ser Gly Ser 

725 730 735 

Pro His Ala Val Tyr Phe Leu Leu Leu Leu Thr Leu Leu He Gin Ser 
740 745 750 
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<210> 7 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 7 

ccctctgggt agccagcaca cagcatc 

<210> 8 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 8 

gccatcgtgg tgccggccaa ctacag 

<210> 9 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 9 

gcacacagca tccctggcaa tatctgg 

<210> 10 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 
<400> 10 

cggccaacta cagccaagtg gagctg 

<210> 11 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 11 

atggcccggc agctgctcct cccccttgtg 

<210> 12 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



26 



27 



26 



30 
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<220> 

<223> Primer 



<400> 12 28 
cggctcccgg gcaggaagta gtgttccg 

<210> 13 
<211> 9276 
<212> DNA 

<213> Pichia pastoris 



aaatctiaca tccaaagacg aaaggttgaa tgaaaccttt ttgccatccg acatccacag 60 
alccattctc acacatlagl gccaaacgca acaggagggg atacactagc agcagaccgt 120 
?qcaaacgca ggacctcclc tcctcttctc ctcaacaccc acttttgcca tcgaaaaacc 180 
alcccag?ta ??gggctt:ga ttggagctcg ctcattccaa "ccttctat ^aggctacta 240 
acaccatqac tttattagcc tgtctatcct ggcccccctg gcgaggttca tgtttgttta 300 
Sccgaatg caacaagftc cgcattacac ccgaacatca ctccagatga 99gctttctg 360 
aqtgtggggt caaatagttt catgttcccc aaatggccca aaactgacag "taaacgct 420 
gtcttSSc ctaatatgac aaaagcgtga tctcatccaa gatgaactaa 3«tggttcg 480 
?taaaatqct aacggccagt tggtcaaaaa gaaacttcca aaagtcgcca taccgtttgt 540 
ct?qtttS aSglttgac gaatgctcaa aaataatctc attaatgctt agcgcagtct 600 
ctcKtcgS Kgaaclcc ggtgcacctg tgccgaaacg caaatgggga -cacccgct 660 

SUES SSSS SSSS 3ESS 

assa sass sssss ss=s "™ 

F a cffcbi SSSSS S5 E£SS SK2K !So 

^ ssss ss=2 5222 iigj s 

Ittatlaata ctactattgc cagcattgct gctaaagaag aaggggtatc tctegagaaa 1200 
agagaXKg aagctt.cgt aglattcoot agggeggccg ega.ttaatt JSCcctagac 1260 
;?g!«gttl cwagttcla gttgggcact tacg.gaag. "ggtcttgc "g.ttctaa 1320 

SSSK SSSSS SSSS U 

bsSS 3£S2£ USSSS £25^ 

Is aclsgfcig sssu: ssss sssss 

S SS5S S53K 22SSS S« 
bHi 5« SSSS2S KSSS ? 

coafcacacc cgtcctgtgg atctatcgaa tctaaatgta agttaaaatc tctaaataat 1980 
taaataagtc ccagtt?c?I catacgaacc ttaacagcat tgcggtgagc atctagacct 2040 

l ag SS5=S "ggfaggli 3S i | 

b lis 2s=2 sssss sssss sees | 

Issss sssss SSSKS 25SS2 

cttaqcaact gcacagcttg gttgtgaaca gcatcttcaa ttctggcaag ctccttgtct 2460 
qtca?atcga cagccaacai latcacctgg gaatcaatac catgttcagc ttgagacaga 2520 
agqtctgagg caacgaaatl tggatcagcg tatttatcag caataactag aacttcagaa 2580 
olcccalclg gcatgtcaat actacacagg gctgatgtgt cattttgaac catcatcttg 2640 
glagcagtaa Igaaltggtt tcctggacca aatattttgt cacacttagg aacagtttct 2700 
gttccglaag ccatagclgc tactgcctgg gcgcctcctg ctagcacgat ^cacttagca 2760 
fcaaccttgf gggcaacgta gatgacttct ggggtaaggg taccatcctt cttaggtgga 2820 
gatgcaaaaa SKtttcttt gcaaccagca actttggcag gaacacccag catcagggaa 2880 
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gtggaaggca 
gcaaaacgag 
gcttcatgga 
ggcaattgca 
ccatcaaact 
acaattggtt 
tcttcaattt 
tcagaaggga 
ttggcatctc 
tcgtccaacg 
cattcccagg 
attttagcgt 
gcattgctct 
aaacaggaag 
aagcagtctc 
gcacctttat 
gcctccggaa 
gccaccaaag 
tcactgacag 
gcatcaataa 
gtctcagaaa 
agtggagaag 
accagggtcc 
aaatctaggt 
atcagctcct 
tcagtcgatt 
gcttttccta 
aagggaaatg 
tatttttatt 
gacctgcagg 
taccaggcct 
gctttgttgt 
gcgttgtcgg 
caaagccgcc 
attctgatta 
tatcaatacc 
agttccatag 
tacaacctat 
tgacgactga 
caggccagcc 
gtgattgcgc 
gaatcgaatg 
caggatattc 
atgcatcatc 
gccagtttag 
tcagaaacaa 
gcccgacatt 
atcgcggcct 
tgtttatgta 
aacatcagag 
ccggcgccac 
gggctcgcca 
tggccggggg 
tcaacggcct 
gtcgagtatc 
ggtctcctat 
tctctgactt 
aaatgtcctt 
gaacaaactt 
tgtcgggtag 



gaattgcggt 
agcagactac 
atttcctgac 
taagttcctc 
tggcagttag 
tgactaattc 
cttgtgagga 
cttctttagg 
ctttccttct 
tcacaccgta 
ctatatcttc 
tcaacaaaac 
tacgatccca 
tgcgttccaa 
catcacaatc 
accacaaacc 
tagacttttt 
tagtgaatag 
ggaacttttt 
tggggattat 
aagcataaac 
aaaaaggcac 
tatagataac 
ccaaaatcac 
caaattggtc 
gagtgaactt 
ccaaactcaa 
tcatacttga 
atcagtgagt 
gggggggggg 
gaatcgcccc 
aggtggacca 
gaagatgcgt 
gtcccgtcaa 
gaaaaactca 
atatttttga 
gatggcaaga 
taatttcccc 
atccggtgag 
attacgctcg 
ctgagcgaga 
caaccggcgc 
ttctaatacc 
aggagtacgg 
tctgaccatc 
ctctggcgca 
atcgcgagcc 
cgagcaagac 
agcagacagt 
attttgagac 
aggtgcggtt 
cttcgggctc 
actgttgggc 
caacctacta 
tatgattgga 
cagattatgc 
ttggtcatca 
cttggagaca 
cttgtttcga 
gaatggagcg 



tccaccagga 
accagggcaa 
gttatctata 
tgggaaagga 
ttctaaaagg 
cataatctgt 
ggccttagaa 
tttggattct 
agtgaccttt 
cttggcacat 
cttggattta 
ttcgtcgtca 
caaggtggct 
gtgacagaaa 
caattcgata 
gtgacgacga 
ggacgagtac 
accatcgggg 
gacatcttca 
accagaagca 
agttctacta 
agcgatacta 
cctagcgcct 
ttcattgata 
ctctgtaacg 
gatcaggttg 
ggaattatca 
agtcggacag 
cagtcatcag 
gcgctgaggt 
atcatccagc 
gttggtgatt 
gatctgatcc 
gtcagcgtaa 
tcgagcatca 
aaaagccgtt 
tcctggtatc 
tcgtcaaaaa 
aatggcaaaa 
tcatcaaaat 
cgaaatacgc 
aggaacactg 
tggaatgctg 
ataaaatgct 
tcatctgtaa 
tcgggcttcc 
catttatacc 
gtttcccgtt 
tttattgttc 
acaacgtggc 
gctggcgcct 
atgagcgctt 
gccatctcct 
ctgggctgct 
agtatgggaa 
ccaactaaag 
gtagactcga 
gtaaatgaag 
actttttcgg 
ggcaaatgct 



atatagaggc 
gtctcaactt 
gagagatcaa 
gcttctaaca 
gctttgtcac 
tccgttttct 
acgtcaattt 
tctttaggtt 
agggacttca 
ctaactaatg 
gcttctgcaa 
aataaccgtt 
tccatggctc 
ccaacacctg 
cccagcaact 
gattggtaga 
accaggccca 
cggtcagtag 
gaaagttcgt 
acagtggaag 
ccgccattag 
gcattagcgg 
gggatcatcc 
ccattattgt 
gatgactcaa 
tgcagctggt 
aactctgcaa 
tgagtgtagt 
gagatcctct 
ctgcctcgtg 
cagaaagtga 
ttgaactttt 
ttcaactcag 
tgctctgcca 
aatgaaactg 
tctgtaatga 
ggtctgcgat 
taaggttatc 
gcttatgcat 
cactcgcatc 
gatcgctgtt 
ccagcgcatc 
ttttcccggg 
tgatggtcgg 
catcattggc 
catacaatcg 
catataaatc 
gaatatggct 
atgatgatat 
tttccccccc 



atatcgccga 
gtttcggcgt 
tgcatgcacc 
tcctaatgca 
tggtgatacc 
caaccggagg 
actgtgagac 
tcccaccaat 
tgccttgaac 
taccttctgg 
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caactttctc 
gcaacgtctc 
tggctctctt 
caggtgtctt 
cattttgacg 
ggataggacg 
tgcacaattc 
gttccttggt 
tatccaggtt 
caaaataaaa 
gttcatcagc 
tggtataaga 
taagaccctt 
tttgttcaac 
tttgagttgc 
ctccagtttg 
acgagtaatt 
tcaaagacgc 
attcagtagt 
tcacatctac 
tgaaactttt 
gcaaggatgc 
tttggacaac 
acaacttgag 
cttgcacatt 
cagcagcata 
cacttgcgta 
cttgagaaat 
acgccggacg 
aagaaggtgt 
gggagccacg 
gctttgccac 
caaaagttcg 
gtgttacaac 
caatttattc 
aggagaaaac 
tccgactcgt 
aagtgagaaa 
ttctttccag 
aaccaaaccg 
aaaaggacaa 
aacaatattt 
gatcgcagtg 
aagaggcata 
aacgctacct 
atagattgtc 
agcatccatg 
cataacaccc 
atttttatct 
ccccctgcag 
catcaccgat 
gggtatggtg 
attccttgcg 
ggagtcgcat 
cgcattcttc 
aggagatttc 
tatctcggtt 
aaagaaatcc 
tataaaatgt 
accttcaaga 
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aataggtctt 
cgttagttga 
aacgttatct 
caaagcgact 
aacattgtcg 
acgaagggca 
aatacgacct 
gtatcctggc 
tctctccacc 
taagtcagca 
ttcctcccta 
accttctgga 
tgattggcca 
cacaaatttc 
tccagatgta 
tgtccttata 
agaagagtca 
caacaaaatt 
caattgccga 
caactttgcg 
caaatcgccc 
aactttatca 
tctttctgcc 
caagttgtcg 
aacttgaagc 
gggaaacacg 
tgcaggtagc 
tctgaagccg 
catcgtggcc 
tgctgactca 
gttgatgaga 
ggaacggtct 
atttattcaa 
caattaacca 
atatcaggat 
tcaccgaggc 
ccaacatcaa 
tcaccatgag 
acttgttcaa 
ttattcattc 
ttacaaacag 
tcacctgaat 
gtgagtaacc 
aattccgtca 
ttgccatgtt 
gcacctgatt 
ttggaattta 
cttgtattac 
tgtgcaatgt 
gtcggcatca 
ggggaagatc 
gcaggccccg 
gcggcggtgc 
aagggagagc 
agtgtcttga 
atggtaaatt 
atgacagcag 
ttgttatcag 
agagtggata 
ggtatgtagg 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 



otttgtagat actgatgcca acttcagtga caacgttgct atttcgttca aaccattccg 6540 
latccaglga aatlaalgtt gtttgtctac tattgatcca agccagtgcg S^ttgaaac 6600 
taacaataqt gtgctcgtgt tttgaggtca tctttgtatg aataaatcta gtctttgatc 6660 
S!taaKt Slcgagcca aggcgataaa tacccaaatc taaaactctt ttaaaacgtt 6720 
aaaaqgacaa g?atg?ctgc clgtlttaaa ccccaaatca gctcgtagtc tgatcctcat 6780 
ctacltgagg ggcactatct tgttttagag aaatttgcgg agatgcgata tcgagaaaaa 6840 
ggScgltgl ??ttaaacgt gaaatttatc tcaagatctc tgcctcgcgc g"tcggtga 6900 
tlacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 6960 
ggatgccggg agcagaclag cccgtcaggg cgcgtcagcg ggtgttggcg S^gtcgggg 7020 
Eqcaqccatg acccagtcac gtagcgatag cggagtgtat actggcttaa ctatgcggca 7080 
tcaalacaqa ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta 714 0 
aggagaaaat accgcatLg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 7200 
qtcqttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 72&u 
laatcagggg aLIcgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 7320 
cqtaaaaagg ccgcgftgc? ggcgtttttc cataggctcc gcccccctga cgagcatcac 73 80 
alaaatcqlc gc^caagtca glggtggcga aacccgacag gactataaag ataccaggcg 7440 
tttccccKg gaagctlcct cgtgcgctct cctgttccga ccctgccgct taccggatac 7500 
?tc?cccttc ggglagcgtg gcgctttctc aatgctcacg ctgtjggtat 7560 
ctcaqttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 7620 
cccqaccgc? qcgccltalc cggtaactat cgtcttgagt ccaacccggt aagacacgac 7680 
tta?cgc?ac ?ggcagcagc clctggtaac aggattagca gagcgaggta tgtaggcggt 7740 
gctacagagt tcltglagtg gtggcctaac tacggctaca ctagaaggac gtatttggt 7800 
atctacqctc tqctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 78 bu 
aaaclalcca ccqcLgtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 792 0 
aaaaaaqqa? ctSagaaga tcctEEgatc ttttctacgg ggtctgacgc tcagtggaac 7980 
aaaaaclcac qt?aagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 8040 
cttttaa^t aaaaatgaag ttttlaatcl atctaaagta tatatgagta aacttggtct 8100 
gacagt?acc aatgctLa? cagtgaggca cctatctcag cgatctgtct atttcgttca 8160 
Iccataqttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 8220 
qqcccclgtg cLcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 8280 
SaaacSg? cagccggaag ggclgagcgc agaagtggtc ctgcaacttt atccgcctcc 8340 
atccagtcla ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 8400 
cacaalqttq ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct 8460 
tcattclgc? ccggttccca algalcaagg cgagttacat gatcccccat 9"gtgcaaa 8520 
aaaqcgSta gclccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 8580 
tcaltlKqS ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 8640 
ttttctgtgl ctgg?gag?a ctclaccaag tcattctgag aatagtgtat 9=9^9accg 8700 
agttgc?c?t gcclggcgtc aacacgggat aataccgcgc cacatagcag aa tt aaaa 8760 
g?gc?catca Itggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 8820 
laltccaqtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 8880 
accaqcKt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 8940 
qcgacacgga aaStgalt actcatactc ttcctttttc aatattattg aagcatttat 9000 
caSgtSt gtc?ca?gag cggatacata tttgaatgta tttagaaaaa taaacaaata 9060 
ggggttccgc gcacattEcc ccgaaaagtg ccacctgacg tctaagaaac "ttattatc 9120 
Sacattla cctataaaaa taggcgtatc acgaggccct "cgtcttca agaattaatt 9180 
ctcatgtttg acagcttatc atcgataagc tgactcatgt tggtattgtg aaatagacgc 924 0 
agatcgggaa cactgaaaaa taacagttat tattcg 



<210> 14 
<211> 11 
<212> PRT 

<213> Pichia pastoris 
<400> 14 

Lys Arg He Ala Ser Gly Val He Ala Pro Lys 
1 5 10 

<210> 15 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 15 

tgggtcttgg cacctgccag ctgctttctg 

<210> 16 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 16 

gaagggggaa gtggtgctgg gaccctag 

<210> 17 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 17 

ccctctgggt agccagcaca cagcatc 

<210> 18 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 18 

gccatcgtgg tgccggccaa ctacag 

<210> 19 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 19 

atcgtggtgc cggccaacta cagccaagtg 

<210> 20 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 20 

acccatcacc tgctcccgta tccatgcctc 

<210> 21 
<211> 103 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Polypeptide 



Va^Ser'Arg Leu Val He Ser He Arg Leu Pro Gin His Leu Gly Leu 
Arg Pro Pro Leu Ala Leu Leu Glu Leu Ser Ser Arg Val Glu Pro Ser 
Pro Ser Ala Leu Pro He Cys Leu His Pro Ala Gly lie Pro Pro Gly 
Ala Ser Cys Trp Val Leu Gly Trp Lys Glu Pro Gin Asp Arg Val Pro 
Val A°ia Ala Ala Val Ser lie Leu Thr Gin Arg He Cys Asp Cys Leu 
Tyr Gin Gly He Leu Pro Pro Gly Thr Leu Cys Val Leu Tyr Ala Glu 



85 

Gly Gin Glu Asn Arg Cys Glu 
100 



<210> 22 
<211> 37 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Polypeptide 



A 4 sn°Asp 2 Ser Ar g Trp Ser Leu Leu Cys Gin Glu Glu Gly Thr Trp Phe 
L eu Ala Gly He Arg Asp Phe Pro Ser Gly Cys Leu Arg Pro Arg Ala 



20 25 



Phe Phe Pro Leu Gin 

35 
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